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Abstract 


The square of the horizontal projected (rms) beam emittance is expressed as the sum of four 
nonnegative contributions, each described using the slice moments of the beam and possessing a 
natural interpretation in terms of the geometrical properties of the beam in the six-dimensional 
phase space. The mathematical formalism describing the relationships between projected beam 
quantities and slice beam quantities is reviewed. The results may be used to reconstruct the 
moments and emittances of the beam from the moments of its subpopulations, as well as to 
isolate and better understand a variety of slice and interslice dynamical contributions to the 
projected beam emittance growth. 


1 Introduction 


The concepts of slice and projected beam emittance play a central role in the design and optimiza¬ 
tion of next-generation FEL light sources [T], in the theory of emittance compensation m-m, and 
in the general theory of beam dynamics in linear and circular accelerators [1]. The slice formalism 
is particularly useful in the case of long beams with small uncorrelated energy spread, for which 
the longitudinal slice-to-slice variation in the fields experienced by particles in the beam (e.g., due 
to Wakefields and time-dependent RF fields) can be significant, while there is little particle slippage 
between slices. In this case, the dynamics of particles within each slice may profitably be separated 
from the dynamics of the slice centroids. 

This separation leads naturally to the question of the relationship between the projected (rms) 
emittance and the properties of the various beam slices. We demonstrate that the square of the 
projected emittance can be decomposed as the sum of four contributions, each with a distinct 
geometrical interpretation in terms of the slice beam moments. Fragments of a treatment along 
these lines can be found in various forms in the literature 0-19]. The purpose of this paper is to 
consolidate these results in the form of a general and carefully presented mathematical treatment, 
with proofs of some nontrivial features included. Several tools are presented that point the way 
toward a similar treatment of other diagnostic beam quantities such as the projected beam energy 
spread, the 4D and 6D emittances, and the eigenemittances for coupled beams m- 
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2 Basic formalism 


Let / denote a beam distribution function on the 6D phase space, which we parameterize by 
canonical coordinates X = {x,px,y,Py,z,pz)- For convenience, we will assume that all momenta 
are normalized by the quantity me. By assumption, / has the properties that / > 0 and 

I f{X)dX = 1. (1) 

Thus, / is a joint probability density in the six random variables x, Px, y, Py, z, and pz that 
characterizes the distribution of the beam particles. If (j) is any function defined on the phase space, 
we define its beam average in the natural way: 

(</.) = J <P{X)f{X)dX. (2) 

Assume that z is the direction of beam motion, and consider the planes of constant z, which 
we describe as longitudinal slices. Let ( = {x,px,y,Py,Pz) denote the remaining phase space 
coordinates, so we will write X = {z,Q when we wish to emphasize the separation of the slice 
coordinate z from the coordinates within a slice C- There are two natural functions associated with 
this decomposition into slices. First, the longitudinal density profile of the beam is given by the 
following marginal probability density; 

Hz) = J f{z,C)dC- (3) 

Second, the slice distribution function for the slice z is given by the conditional probability density: 

" r ^ provided A(z) > 0, (4) 

J f{z,()dC Hz) 

with f{C\z) = 0 when A(z) = 0. It follows that both of these quantities are nonnegative and 
normalized so that: 

J X{z)dz = 1, J f{C\z)dC = 1. (5) 

Given any function dehned on the phase space, we may dehne its average within slice z by using 
the slice distribution function: 

{Hz = J Hz,C)f{C\z)dC. (6) 

We use the notation /|z to refer to the the slice distribution function when we wish to avoid listing 
its arguments, in order to distinguish this quantity from the full beam distribution function /. 


3 Slice and projected moments 


We use the notation jl and S to denote the centroid vector and covariance matrix of the beam, 
respectively, whose elements are given by: 

Ma = (Xa), T.ab = {{Xa - Pa){Xb - Pb)) for 0,6=1,..., 6. (7) 
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Likewise, define the slice centroid vector /2(z) and slice covariance matrix S(z) for slice 2 : by: 


= {Xa) T.ab{z) = {{Xa - fia{z)){Xb - Hb{z)))^ for 0=1,..., 6. (8) 

Note that we have used two distinct forms of averaging () and ()^, associated with the probability 
densities / and f\z. Given a quantity cj) that depends on 2 , we may also define an average across 
slices by using the longitudinal density profile A: 

E[(/)] = J (f){z)X{z)dz. (9) 

Likewise, given two 2 —dependent quantities (/>i and 02, we define their covariance in analogy with 
d?]) and dSI): 

Cov[ 0 l, 02 ] = E [(01 - E[ 0 i ])(02 - E[ 02 ])], ( 10 ) 

and the variance is defined in the usual way as: 

Var[0] = Cov[0,0] = E[(0 - E[0])2]. (11) 

The projected ([7|) and slice Q moments of the beam can now be related using two fundamental 
results from the theory of conditional probability. The law of total expectation gives: 

fia = HlJ-a{z)] for a = l,...,6, (12) 

and the law of total covariance gives: 

Safe = E[Safe( 2 ;)] + Cov[//a( 2 ;),//fe(z)] for a, 6 = 1,..., 6. (13) 

Note that the notation in (|12til3p makes sense, because the slice moments appearing within the 
brackets [•] on each right-hand side are 2 —dependent quantities. A proof of each of these claims is 
provided in Appendix A. 

The projected (rms) normalized emittance of the beam in the {x,px) plane is typically defined by: 

= ((x - {x)f){{px - {p.)f) - {{x - {x)){px - {Px)))\ (14) 

with expressions of the same form for and e^. It is convenient to express these definitions using 
the 2x2 blocks of the 6D covariance matrix associated with each degree of freedom: 

yX _ yY _ f^33 ^340 z _ f^55 

“ ^5^21 ^ 22 ] ~ VS 43 ^ 44 ; “ V^65 See 

so that: 

e^ = detS^, Cy = detTr^, e2 = detS^. (16) 

In the same way, we may express the emittances within slice z as: 

6 ^( 2 ) = det ^^(z), Cy = detTi^ (z), el = detT,^ (z), (17) 

where Ti^{z), T?' {z), and Xi^(z) are obtained from the slice covariance matrix ^(z) dehned in ([8]). 
It follows from our definitions that e^lz) = 0 for all z, so this quantity will play no role in the 
discussions to follow. 



3 


4 Decomposition of the projected emittance 


In this section we will show that the horizontal projected emittance (|14|1 can be written as the 
sum in quadrature of four contributions, each with a natural interpretation in terms of the slice 
properties of the beam. Corresponding results apply also for the projected emittances in the other 
two planes. 


Define a 6x6 matrix A according to: 


Kab = Co\[iXa{z),^Xb{z)], for a, 6 = 1 ,..., 6 
and likewise define the 2x2 blocks of A associated with each degree of freedom: 

\X ^ (^11 ^ 12 \ = (^33 ^ 34 \ = (^55 

VA21 A22y VA43 VAes Ase^- 

Then it follows from (I13p that we have the 2x2 matrix identities: 

= E[S^(z)]+A^, = E[S^(z)]+A^, = E[S^(z)] + A^. 

Therefore according to (I16p the horizontal projected emittance is given by: 

el = det (E[S^(z)] + A^) . 


(18) 


(19) 


( 20 ) 


( 21 ) 


We exploit the following useful identity that applies for any sum of 2x2 matrices, which the reader 
may directly verify: 


det(A + B) = det(A) + det(i3) + tr(A) tr{B) — ii{AB). (22) 


This gives: 

el = det(E[S'’‘‘(z)]) + det(A^) + tr(E[S^( 2 ;)]) tr(A^) — tr(E[S^( 2 ;)]A'’‘‘) 

= det(E[S'^( 2 :)]) + efjjt + e^, (23) 

where we have defined: 

e^=detA^, (24) 

eL = tr(E[S^(z)])tr(A^) - tr(E[S^(z)]A^). (25) 

The significance of these quantities will soon become apparent. We further decompose the first 
term appearing on the right-hand side of (|23p as follows: 

det(E[S^(z)]) = E[Sn(^)] E[S 22 (^)] - n'^i 2 {z)f 

= E[Sn(^)E 22 ( 2 )] - Cov[Sn(z), S 22 (^)] + Var[Si 2 (z)] - E[^i 2 {z)^] 

= E[e^( 2 ;)] - Cov[Sii(z), S 22 (^;)] + Var[Si 2 (z)] 

= E[ex{z)f + Var[ea;( 2 ;)] - Cov[Sii(z), ^ 22 ( 2 :)] + Var[Si 2 ( 2 ;)] 

= ( 26 ) 
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where: 


ei = E[ea;(z)]^, 

= \sj[ea:{z)] - Cov[Sii( 2 :), T. 22 {z)] + Var[Si 2 (^)]. 


(27) 

(28) 


Thus, we have shown that: 

+ ^int + ( 29 ) 

In Appendix B, we prove that each of these four contributions is nonnegative, so that e_L, sr, eint 
and e|| are always real quantities. 

Changing to a more intuitive notation, let us write the first moments of slice z in the form: 


fJ-x{z) = fii{z) ^ip^{z) = ii2{z), (30) 

which denote the horizontal position and momentum of the slice centroid. Similarly, let us write 
the second moments of the slice z in the form: 

= ^n{z), o-pjz) = ^ 22 ( 2 :), {AxAp^)^ = T.i2{z). (31) 

Expressing the four emittance contributions in terms of these quantities gives, after writing out 
(j24ll25j) explicitly: 

ej_ = 'Ej[ex{z)]‘^, (32a) 

4 = Var[ea;(z)] - CoY[al{z), a^Jz)] + Var[(AxAp,,) J, (32b) 

= H(^l{z)]yai[np^{z)]+E[alJz)]Yar:[nx{z)] - 2E[{xpx)^]Cov[nxiz), Pp^iz)], (32c) 
ej = Var[^a,( 2 ;)] Var[//p^( 2 ;)] - Coy[px{z), Pp,{z)]^■ (32d) 

Let us consider each of these contributions in turn. 


1. The quantity 


e_L = E[ej;(2;)] 


is the mean slice emittance. It is determined by computing the emittance within each longitudinal 
beam slice, and then taking the mean among slices, with each slice weighted by its corresponding 
longitudinal density. It vanishes if and only if every slice of the beam has zero emittance. 


2. The quantity €r is the mismatch emittance, and its value is determined by the size of variation 
in the beam second moments (the covariance matrices E^{z)) from slice to slice. When ex{z) 7 ^ 0, 
it is typical to define the slice Twiss functions a{z), /3{z), and 7 ( 2 ;) such tha 10 : 

E^{z) = ex{z) with / 3 (z) 7 (z) - a{zf = 1. (33) 

If the slice Twiss functions a(z), P{z), and 'y(z) are identical for every slice 2 ;, then cr vanishes. 

^Each mention of the Twiss functions in this work refers to the horizontal Twiss functions, so the subscripts in 
Oix, Px, and have been suppressed; there is no possiblity of confusion with the relativistic factors /3 and 7 , since 
these appear only once in (I 66 II . Since the emittance in (1331) is normalized, the Twiss functions here differ from the 
geometric Twiss functions by the appropriate relativistic factor. 
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3. The quantity eint is the linear misalignment emittance, and its value is determined by the size 
of variation in the first moments (slice centroids) from slice to slice. This is a “cross term”, as the 
contributions from the centroid variations are themselves weighted by the mean second moments. 
If the centroids of all beam slices are aligned, so that gLx{z) and fap^{z) are identical for every slice, 
then Cint vanishes. 

4. The quantity e|| is the nonlinear misalignment emittance, and its value is also determined by 
the size of variation in the first moments (slice centroids) from slice to slice. Unlike Cint, it depends 
exclusively on the slice centroid coordinates. In particular, e|| is the beam emittance that would 
result if all charge within each slice were concentrated at the location of the corresponding slice 
centroid. If all slice centroids are aligned, then e|| vanishes. 

Appendix B describes the conditions under which each of these contributions vanishes. The reason 
for separating the terms Cint and e|| will become more clear in the following section. 


5 Special cases 


The decomposition of the previous section is valid for a general beam distribution. However, there 
are several idealized special cases that arise frequently in the literature. We will consider four of 
these in this section, with the hope that this will clarify the significance of the four contributions 
given in ([32]) . 


5.1 Uniform slice emittance 

First, suppose that the emittance of every longitudinal beam slice is identical. More precisely, 
assume that: 

e^iz) = £[€ 3 ,( 2 ;)] = ex (34) 

for every z with A(z) 7 ^ 0. Provided ex ^ 0, it is natural to define the slice Twiss functions a{z), 
I3{z), and 'y(z) as in (f33]l . so that 

= e±l3{z), a^Jz) = e_ij{z), {AxAp^) ^ =-e_ia{z). (35) 

In this case the contribution ex is given simply by p4l) . and the contribution e|| is unchanged from 
its original form (|32|) . We can re-express the remaining two contributions in terms of the Twiss 
functions as: 

4 = ei {- Cov[/3(z), 7 (z)] -|- Var[a(z)]} , (36a) 

efnt = e±{E[/3(^)]Var[^pJz)] -bE[ 7 (z)]Var[/ra;(z)] + 2E[a(z)]Cov[/i^(z),/ip^(z)]}. (36b) 

Here we have used the facts that for any constant a and any z—dependent functions (f), 4)i, and (j) 2 - 
E[a(/>] = aE[(/)], Cov[a(/>i, 0 ^ 2 ] = Cov[(/)i, ^ 2 ], Var[a(?()] = a^Var[(/>]. (37) 

Also, we used the fact that due to the assumption in (l34|l : 

Var[e,(z)] = E[(e,(z) - E[e,(z)])2] = 0. (38) 
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The expression (I36ap indicates that the mismatch emittance ept is determined by the size of slice-to- 
slice variations in the beam Twiss functions. However, it is convenient (and somewhat surprising) 
that cr can also be written directly in terms of the mean Twiss functions. This can be seen by 
noting from (j33[) that: 

1 = E[/ 3 (z) 7 (z) - a‘^(z)] = E[/ 3 ( 2 ;) 7 (z)] - ^[a^{z)] (39) 

= Cov[/ 3 ( 2 ;), 7 (z)] + E[^( 2 ;)] E[ 7 ( 2 ;)] - Var[a(z)] - E[a(z)]^ (40) 

so: 

4 = 4 {E[/3(z)] E[ 7 (z)] - E[a(z)]2 - l} . (41) 

When I3{z), 7 ( 2 :), and a(z) are each independent of z, we see that this quantity indeed vanishes as 
a consequence of (f33]) . 


5.2 Uniform slice Twiss functions 

Suppose that the Twiss functions of every longitudinal beam slice are identical. More precisely, 
assume that ex{z) 7 ^ 0 and: 

a{z) = E[a{z)], P{z) = E[^(z)], 'y{z) = E[ 7 (z)], (42) 

for every z with A( 2 ;) 7 ^ 0. We let oq, /3o, 70 denote the uniform values given in (f42]l . The mismatch 
emittance in ()32p then becomes: 

e| = Var[ea,(2;)] - Cov[ea;(z)/3o, ix{z)-io] + Var[-ea,(2;)ao] 

= Var[ 63 ,( 2 ;)] -/ 3 o 7 oCov[e 3 ;( 2 ;),€ 3 ,( 2 ;)] + Oq Var[ 63 ,( 2 :)] 

= Var[ 63 ,( 2 ;)] - (/3o7o - 4) Var[ 6 a,( 2 ;)] = 0, (43) 

since /3o7o “ ~ Thus, the misalignment emittance vanishes. The contributions 6 _l and 6 || 

are independent of the slice second moments, and their form remains unchanged. The remaining 
contribution becomes: 

4 t = E[63,(2;)/3 o] Var[/rpj 2 ;)] + E[63,(2;)7o] Var[/i3,(2;)] - 2 E[-63,(2:)ao] Goy[hx{z), ^ip^{z)\ 

= {/3o Var[/rp^( 2 ;)] + 70 Var[/r 3 ,(z)] + 2ao Cav\iJ,x{z), ^lp^{z)]} . (44) 

5.3 Aligned slice centroids 

Next, suppose that all slice centroids are aligned. More precisely, assume that: 

Hx{z) = E[/r 3 ,( 2 ;)], Hp^{z) = E[/ip^( 2 ;)], (45) 

for every 2 ; with A( 2 ;) 7 ^ 0. It immediately follows that: 

\aI[^lx{z)] = Yai[np^{z)] = Cov[^lx{z), Hp^{z)] = 0, (46) 

and so 6 int = e|| = 0. Thus, the only projected emittance contributions are due to the mean slice 
emittance and the mismatch emittance. 
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(47) 


This case is often treated by introducing a slice mismatch parameter [m-na, given by: 

C( 2 ;) = ^ {'y{z)/3 - 2a{z)a + / 3 (z) 7 } , 
where a, /3, and 7 are the projected Twiss functions, defined by: 




f3 —a 
-a 7 


with 13'y — = 1 , 


(48) 


and Cx is the projected emittance. An alternative expression is given in terms of the projected and 
slice Twiss matrices, given respectively by: 


R = 




R{z) 


f (3{z) -a( 2 :)\ 

V-«(^) liz) ) ’ 


from which ()47p takes the form: 

C{z) = ^tT:{R-^R{z)). 


(49) 


(50) 


The slice mismatch parameter has the properties that ^(- 2 ^) > 1; and Ciz) = 1 if and only if 
R{z) = R. (A proof is included in Appendix C.) As a result, C{z) is a measure of how far the Twiss 
functions of slice z vary from the projected Twiss functions of the beam. 


Multiplying (|47p by the slice emittance at z reveals that: 

€x(z)C(z) = ^ {'S22iz)/3 + 2 Si 2 (z)a + Sii(z) 7 } . (51) 

Applying (f46j) to (fl^ . we see that in the present case the projected and slice covariance matrices 
are related by: 

T^ab = R[^ab{z)], a, 6 = 1,..., 6. (52) 

Taking the average of (ISTIl across slices using (l5^ gives immediately the projected emittance: 

E[€xiz)C{z)] = ^ {S 22/3 + 2Ei2a + S 117 } = ex {/3'y - a^} = e^- (53) 

It therefore follows that: 

el = E[ex{z)az)f-E[e.{z)?. (54) 

It is clear from (1541) that when ^(z) = 1 for every slice (the perfectly matched case), then cr = 0. 


5.4 Linear slice centroid misalignment 


For the final case, assume a purely linear misalignment of the beam centroids. More precisely, 
assume that: 


tlx{z)= Hx{Zc) + {z- Zc)ti'x, ^p^(z) = /ip^(Zc) + (z - Zc)/ip^, (55) 

for slices with A(z) 7 ^ 0. Here Zc = E[z] denotes the longitudinal location of the beam centroid, and 
/i(,, are constants. This is a case that occurs regularly in linear optics systems with nonzero 
transverse-longitudinal coupling. 


The quantities €± and cr are independent of the slice centroid values, and their form will therefore 
remain unchanged. To evaluate the remaining two emittance contributions, note from ([55]) that: 


\av[nx{z)] = , Var[/Xp^(z)] = Var[2;](^p^)^ Cov[i^^{z), Hp^{z)] = Var[z]/i^/ip^. 

(56) 

An immediate consequence is that the nonlinear misalignment emittance vanishes: 

4 = Var[/ia:(^;)] Var[//p^(z)] - Cov[ij,^{z), iJ.p^{z)f = 0. (57) 

Evaluating the linear misalignment emittance gives: 

4nt = + -2E[{xp^)Jn'^^'pJ . (58) 


To interpret ([581) . it is helpful to re-express the quantities involved using the projected beam mo¬ 
ments. To determine and Pp^, for example, take the covariance of (I55p with the slice coordinate 
2 ; to see that: 


Cov[ 2 :, px{z)] = Cov[ 2 ;, Px{zc)] + Cov[ 2 :, 2 ; - Zc]lJ.'x = YaT[z]fi'^, (59) 

with a corresponding expression involving However, it also follows from ()13[) that: 


since for each slice z 

Thus we have: 


S55 = E[S55(2;)] -h Cov[n5{z), P5{z)] = Var[2;], 

S51 =E[S5i(2:)] +Cov[p 5 {z),fii{z)] = Cov[z,Hxiz)], 

^55{z) = 0, S 5 i(z) = 0, H5{z) = z. 

S 52 


/ ^51 

^55 


= 


S 55 


(60) 

(61) 

(62) 

(63) 


These quantities jointly characterize the size of transverse-longitudinal correlations in the beam. 
Likewise, applying (fT^ together with (l56l) gives: 


Ell = E[Sii(2;)] -F Cov[ni{z), ni{z)] = E[cr^(2;)] +YaT[z]{n'^f, ( 64 a) 

S22 = E[E22(2:)] + Coy[p2(.z), P2{z)] = E[ap^{z)] +YaT[z]{fipj‘^, ( 64 b) 

Ei2 = E[Ei 2(2:)] -h Cov[fii(z), iJ,2iz)] = E[{AxApx)J +YaT[z]iJ,'^Pp^. ( 64 c) 

Using (IMj) and (1631) to re-express ([581) in terms of the projected beam covariance matrix gives: 

^hit = — (^11^52 + — 2S12E51S52) . ( 65 ) 

E55 


Note that this quantity is defined only when E 55 = 7 ^ 0. This is reasonable, since when Uz = 0, 

the decomposition of the beam into longitudinal slices is no longer meaningful. 
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6 Generalizations and Related Results 


The decomposition given in (|32|) can be generalized in a number of ways. First, note that the 
treatment leading up to ([32]l is independent of the phase space variables used, and the same result 
may be applied if the conjugate canonical variables {z,pz) are replaced by the conjugate variables 
{t,pt) or {—ct,5), as are typically used to describe the beam distribution at a fixed longitudinal 
location, where for example: 

pt = 5 = A 7 / 70 . ( 66 ) 

In this case, the longitudinal beam slices are defined by the temporal coordinate t (or —ct) rather 
than by the spatial coordinate z. Alternatively, one may decompose the beam into slices based 
on the horizontal coordinate x or the vertical coordinate y. The following two cases are worth 
describing in some detail. 


6.1 Emittance decomposition using 3-D spatial coordinates 

The discussion of Section 2 can be modified by replacing the decomposition of the beam into 
longitudinal slices (using the longitudinal coordinate z) with a decomposition of the beam into 
spatial points (using the three spatial coordinates x, y, and z). To see how this is done, we write 
X = (r,p) to emphasize the separation of the spatial r = {x,y,z) and momentum p = {px,Py,Pz) 
phase space coordinates. 

As in Section 2, the beam distribution function / defines a probability density on the 6-D phase 
space, and it is natural to define both a marginal and a conditional probability density associated 
with the decomposition into spatial points. In place of the marginal probability density given in 
([3|), we now have the 3-D spatial density: 

/ 5 (r) =//(r,p)dp. ( 67 ) 

Likewise, in place of the conditional probability density given in ([3]), we now have the distribution 
function at a point r: 

/(pW = ^. CSS) 

provided p{r) > 0 , with /(p|r) = 0 otherwise. The averages {■)^ and E[-] are replaced by their 
analogues defined using densities (1671) and (f68]l . so we have the average of a quantity 0 at a point 
r given by: 

(</>)r = y'?^(>'>P)/(p|r)c?P, (69) 

and the spatial average of any r— dependent quantity (p given by: 

E[(j)] = j 0 (r)p(r)dr. ( 70 ) 

In the derivation leading up to the emittance decomposition ([32]l . the conditional expectation and 
conditional covariance of Appendix A are now evaluated for a fixed triple of spatial coordinates r, 
rather than for a fixed slice coordinate 2 ;. Using the same theorems, the results of Section 3 carry 
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over directly when one takes z r and replaces the averages (•)^ and E[-] with (1691) and (f70]l . Note 
that in this case we have: 

fix(r) = X, (71) 

and the covariance matrix at point r takes the form: 


S^(r) 



(72) 


Of the four contributions to the projected emittance given in (l,'12h . it follows from (I72p that two of 
these always vanish: 

4 = 4 = 0, (73) 

and the remaining two take the forms: 


4t = E[fTp.(r)] Var[/r^(r)], (74a) 

4 = Var[/ia;(r)] Var[/rp^(r)] - Cov[^^(r), ^pjr)]^ (74b) 

The contribution (I74al) is the intrinsic beam emittance due to intrinsic variations in the particle 
momentum at each point, while (|74bp is the correlated beam emittance due to spatially correlated 
variations in the particle momentum. 


It is convenient to define the transverse beam temperature [3] at a point r by using the momen¬ 
tum spread at r: 


ksTxir) 


mc^ 




(75) 


where ks is Boltzmann’s constant. In this way, we see that (I74ap is proportional to the spatial 
average of the transverse beam temperature times the square of the transverse rms beam size. 
Meanwhile, (|74bp is the projected emittance that the beam would have if it were cold —that is, if 
all particles at a given point were given a momentum equal to the mean momentum jjLp^ (r) at that 
point. 


6.2 Emittance decomposition using finite subpopulations 


In the previous sections, we described the beam by a continuous density / on the phase space: 
This is necessary for a rigorous discussion of slice moments and emittances, since these quantities 
are in general undefined when one considers a finite (discrete) particle distribution together with 
longitudinal slices of zero width. However, the decomposition (j32h is based on mathematical tools 
that are quite general, which produce corresponding results for the discrete-particle case. These 
results characterize the relationship between the rms emittances of the beam and the moments of 
its various subpopulations. 

For example, suppose that a finite beam of N particles is divided into M disjoint bins n = 1,..., M. 
These may be longitudinal bins of fixed or varying width, or bins based on some other criterion 
(such as particle energy); we assume only that the bins divide the phase space into non-overlapping 
regions. We set n = 0 for all points X in the phase space that lie outside the M bins, so that 
each phase space point corresponds to a unique index n. Then the value of this index defines a 
discrete random variable on the phase space, which plays exactly the role previously played by the 
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continuous random variable z. The conditional expectation and conditional covariance of Appendix 
A are now evaluated for fixed bin index n, rather than for hxed slice 2 ;. Using the same theorems, 
the results of Section 3 carry over directly when one takes z —>■ n and replaces the continuous 
averages (•), and E[-] with their discrete analogues. 


In particular, let Xj, j = 1,..., N denote the particle phase space coordinates, and let Nn denote 
the number of particles in bin n, with 

M 

Y^Nn = N. (76) 

n=l 

The averages in ([2|) and ([6]) now become: 


, Af .Nr, 

i=i ” i=i 

where the second sum is taken only over those particles that lie within bin n. Thus, {(t>)n denotes 
the bin average of the quantity (j)- We may therefore define the bin centroid vectors fin, the bin 
covariance matrices and the bin emittances, in the natural way. Finally, if 4>n is a quantity 
that varies from bin to bin, then we may dehne its average across bins as: 

N ^ 
i=i 

from which we may also define the covariance of two n—dependent quantities as in (jlOp . The 
horizontal projected emittance of the beam now takes the form: 

^R + ^int + 

where the four emittance contributions are given by (1321) after taking z ^ n. 


7 Conclusions 


The decomposition of a beam into longitudinal slices leads naturally to a corresponding decompo¬ 
sition for each of the projected emittances e^, ey, and ez into distinct nonnegative contributions, 
separating the effect of particle coordinate/momentum variation within each slice from the effect of 
particle coordinate/momentum variation between slices. It is expected that each of these contribu¬ 
tions is affected by distinct dynamical processes in a typical accelerator lattice, including nonlinear 
aberrations due to external focusing fields, transverse-longitudinal coupling due to dispersion, slice 
mismatch due to time-dependent focusing effects, wakefield-induced emittance growth, and a vari¬ 
ety of other effects. As a result, independent consideration of these four contributions may provide 
helpful diagnostic tools for studying the dynamical sources of projected emittance growth in typical 
accelerator systems. 
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A Appendix: Mathematical Theorems 


In this section, we describe several theorems that are useful in studying the relationships between 
slice and projected quantities. 


A.l Theorems relating to probability 

The following is a fundamental theorem regarding conditional probability m-m- We give the 
general result, and then prove the statement that is used in the text (|12m3ll . 

Suppose X, Y and Z are three random variables defined on the same probability space. We use 
standard notation for the expectation E[-] and covariance Cov[-, •] of these random variables. The 
expectation of X and the covariance of X and Y can be evaluated under the condition that Z takes 
on some fixed value Z = z. The resulting quantities are written as: 

E[X\Z = z], Cov[A,y|Z = z], (80) 

denoting the conditional expectation and the conditional covariance, respectively. As the value of 
Z varies, the functions E[A|Z] and Cov[X, Y\Z] are themselves random variables that depend on 
the value of Z. The relationships between these random variables are given by the law of total 
expectation: 

E[A] = E\E[X\Z]] (81) 

and the law of total covariance: 

Cov[A,y] = E[Cov[A,y|Z]] + Cov[E[A|y],E[y|Z]], (82) 

which hold provided E[A] and Cov[A, Y] are each finite, respectively. These results apply indepen¬ 
dently of whether the random variables X, Y, and Z are continuous, discrete, or a mixture of the 
two. 

Consider now the phase space coordinates X, with joint probability density given by the beam 
distribution function /(X). Let Z be the random variable corresponding to the coordinate z. Then 
(|8T]) states that, for any function on the phase space: 

{cf)=Emj. (83) 

In the special case that ffX.) = Xa, this corresponds to the result quoted in the main text (fT^ . 
We now prove (f83]i . 
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Pf / Taking the mean of {4>)^ across slices gives: 


E[(</')J = j Kz){(t>)^dz = j X{z) 


4>iz,C) fiC\z)dC 


dz 


4>iz,C)fiz,C)dC 


dz= / ,/.(X)/(X)dX = (0), 


where in the third equality we have made use of dH). □ 


(84) 


Next, for some indices a and b, let X be the random variable corresponding to the coordinate Xa, 
let Y be the random variable corresponding to the coordinate Xh, and let Z be the random variable 


corresponding to the coordinate z. Then we have, using the notation of the main text: 

CoY[X,Y\Z] = Yab{z), E[X\Z]=fia{z), E[Y\Z] = fik{z), (85) 

and ([82]) corresponds to the result given in the main text as (fT^ . a result that we now prove. 

Pf / We have from the definition of the beam covariance matrix S and ()83p that: 

= {XaXb) - {Xa){Xt) = E[{XaXb),] - E[(X,)J E[(Xfe)J. (86) 

Applying the definition of the slice centroid coordinates fJ-aiz) and fibiz) gives: 

= E[((Xa - fia{z)){Xb - Mz)))z + d'a{z)Hb{z)] - E[Ha{z)] E[nb{z)]. (87) 

Applying the linearity of the expectation value and the definition of the slice covariance matrix: 

= E[Safe(z)] + E[^j,a{z)fJ.b{z)] - E[iJ,a{z)] E[nb{z)], (88) 

which gives using (fTOjl that: 

Yab = E[Safe( 2 ;)] + Cov[fiaiz),fibiz)]. □ (89) 


The following theorem makes use of the notion of convexity. A convex subset of n—dimensional 
space M”' is a subset C with the property that, for all x and y in C and all 0 < t < 1, the point 
tx + (1 — t)y € C. (That is, the line segment joining any two points in C is itself contained in C.) 

If C is a convex subset of M”’, a convex function on C is a function / : C —>■ M such that, for all x 
and y in C and all 0 < t < 1 , 

f{tx + (1 - t)y) < t/(x) + (1 - t)/(y). (90) 

Given a convex function / : (7 —)■ M, a convex subset K <ZC will be called an f-affine subset if and 
only if, for all x and y in AT and all 0 < t < 1, equality holds in (I90|) . 

Jensen’s inequality. Let / be a convex function defined on a convex subset C of M"’, and let X 
be a random variable taking values in C. Suppose that the mean E[X] exists, in the sense that 
E[|X,|] < oo for each element j = 1,..., n. It then follows that E[X] € C, the real number E[/(X)] 
exists, and 

/(E[X])<E[/(X)]. (91) 

Eurthermore, equality holds in (|9ip if and only if there exists an /-affine subset K C C with the 
property that X G AT with probability 1 (a.e.). The proof of (|91ll can be found in |20j-|21j. The 
condition for equality follows from results described in m-m- 
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A.2 Theorems relating to matrices 


A real n x n symmetric matrix A is said to be positive semidefinite if x^Ax > 0 for all vectors 
X € M”, which is typically denoted by writing A > 0. The following results are standard, and the 
reader may refer to one of the references m-m for proofs. 

Property 1: If A is a positive semidefinite matrix, then the eigenvalues of A are nonnegative and 
therefore det(A) > 0. If det(A) ^ 0, then A~^ is also positive semidefinite. 

Property 2: If A is the covariance matrix associated with a vector of random variables, then A is a 
positive semidefinite matrix. 

Property 3: The set of n x n positive semidefinite matrices is a convex subset of the space of real 
n X n matrices. 

Pf / Let A and B he n x n positive semidehnite matrices, and let x € M"", so that; 

x^Ax > 0, x^Bx > 0. (92) 

For any 0 < t < 1, the quantity tA + (1 — t)B is a real n x n symmetric matrix and: 

x^{tA + (1 — t)B)x = t(x^Ax) + (1 — t){x'^Bx) > 0, (93) 

so that tA + (1 — t)B is positive semidefinite. □ 

The following inequalities can be found in m- 

Trace inequality. If A and B are n x n positive semidefinite matrices, then: 

ndet^^'^{AB) < ti{AB) < tr(A)tr(i?). (94) 

Equality always holds when n = 1. Suppose n > 1 and det(A) ^ 0. Then equality holds on the left 
iff i? = cA“^ for some c > 0, and equality holds on the right iff i? = 0, the zero matrix. 

Minkowski determinant inequality: If A and B are n x n positive semidefinite matrices, then 

det^/’^(A + B)> det^/*^ A + det^/” B. (95) 

Suppose det(A) / 0. Then equality holds iff S = cA for some c > 0. 

The following theorem can be viewed as a generalization of (|95ll to a (possibly infinite) ensemble 
of matrices. 

Jensen’s determinant inequality: Let i? be a random variable that takes its values in the set of n x n 
positive semidefinite matrices. Suppose the expectation value E[i2] exists, in the sense that; 

E[|i?ab|] < oo, a, 6 = l,...,n. (96) 

Then the matrix E[i2] is also positive semidehnite, and satishes: 

det^/" E[R] > E[det^/” R]. (97) 
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Furthermore, if the random matrix R has the property that det(i?) 7^ 0, then equality holds in (19711 
if and only if there exists a real-valued random variable c and a constant nxn matrix Rq such that 
R = cRq with probability 1. 

Pf / Let g be the function on the set of positive semidehnite matrices given by: 

g{A) = -dei^l^{A) (98) 

for each ^4 > 0. Given any 0 < t < 1, it follows from (|95p that for any matrices ^ > 0 and B > 0: 

g{tA + (1 - t)B) = - det^/^{tA + (1 - t)B) < - det^/^{tA) - det^/"((l - t)B) 

= -t dei^/^{A) - (1 - t) det^/"(S) = tg{A) + (1 - t)g{B). (99) 

Thus, the function g is convex on the set of positive semidehnite matrices. It now follows from 
Jensen’s inequality (I9TI1 that E[i?] > 0 and: 

5(E[i?]) < E[g{R)], (100) 

from which ()97ll immediately follows. To prove the equality condition, it follows from (|95l) that the 
only S'—afhne subsets in the set of positive semidehnite matrices with det(i?) 7^ 0 are those of the 
form {ci?o ■ c G [a, b]} for some hxed matrix Rq. Thus, equality holds in ([W]) if and only if 72 = cRq 
with probability 1. □ 


B Appendix: Nonnegativity 


It follows from (l8|) and (|18l) that both Tj^{z) and take the form of covariance matrices. By 
Property 2 of Appendix 6.2, each of these matrices is positive semidehnite, and we may apply all 
the theorems of that section. 

1. Mean slice emittance - Since E^{z) is positive semidehnite: 

Cxiz)"^ = detT;^(z) > 0. (101) 

Thus, €x{z) is a real quantity for each z, and so is the mean E[ex{z)]. Then we must have: 

ei = E[exiz)f > 0. (102) 

Purthermore, since ex{z) > 0 for all z, equality holds in (|102p if and only if ex{z) = 0 a.e. 

2. Mismatch emittance - Recall from (I26p that: 

det(E[S^(z)]) = ei + e|. (103) 

The 2x2 covariance matrix E^(z) is positive semidehnite for each z. It follows from Jensen’s 
determinant inequality ()97p that the matrix E[S'’*'(2;)] is also positive semidehnite and satishes: 

det^/2(EpV^^^]^ > E[det^/2(5^v^^^)j ^ E[ea;(x)] = e^. (104) 
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Thus, 


e| = det(E[S^(z)])-ei >0. (105) 

Suppose that the slice emittance is nonzero, ex{z) > 0. Then equality holds in (llOSp if and only 
if 'E^{z) = c{z)Ro a.e. for some real-valued function c{z) and a fixed matrix Rq. We can assume 
without loss of generality that det(i?o) = 1 (otherwise we may scale c appropriately). Thus, 
Rq = R{z) is exactly the slice Twiss matrix and c(z) = €x{z) is the slice emittance. Hence, equality 
holds in (11051) if and only if the slice Twiss functions a(z), (3{z), and 7 ( 2 ;) are constant a.e. 

3. Linear misalignment emittance - Let A and B be the 2x2 matrices defined in (I23h : 

H = E[S^(z)], B = A^. (106) 

Then A and B are each positive semidefinite. By the trace inequality (I94|) . we have: 

efnt = tr(H) tr(H) — ti{AB) > 0. (107) 

Suppose that the slice emittance is nonzero, €± 7 ^ 0. Then det(H) = 7 ^ 0, and equality 

in (jl07p holds if and only if B = = 0, the zero matrix. This implies that 'Var[^x{z)] = 

Yar:[g,p^{z)] = 0, so that ^x{z) = 'R[gLx{z)] and iip^{z) = E[/Xp^( 2 ;)] a.e. That is, the slice centroids 
are aligned. In this case, we also have = det(A^) = 0. 

4. The matrix is positive semidefinite, and therefore: 

ej = det(A^) > 0. (108) 

Suppose that strict inequality holds in ()108l) . so that e|| 7 I 0. By the results of Section 5.4, it 
follows that the slice centroids Hx{z), tip^{z) are not uniform in z, nor are they described by linear 
functions of the form ([55p . That is, one or both of fix{z), Hp^{z) possess a nonlinear dependence 
on the slice coordinate z. 


C Appendix: Slice mismatch parameter 


Let ((z) be the slice mismatch parameter defined in ((5(11) . Then C(z) > 1, and C('Z) = 1 if and only 
if R{z) = R. 

Pf / Since and E^(z) are positive semidefinite matrices and we have assumed that Cx > 0, 
ex{z) > 0, the matrices R = T,^{z)/ex and R{z) = {z)/ex{z) are also positive semidefinite. By 

Property 1 of Appendix A, this is also true of R~^. Furthermore, we have: 

det{R) = det(i2(z)) = 1. (109) 

It follows from the trace inequality (I94p that: 

C{z) = ^ tr(i?“^i?(z)) > det^'^^(i2“^i2(z)) = 1. (HO) 

In addition, equality holds if and only if R{z) = cR for some c > 0. However, we also have: 

1 = det(i?(z)) = det(ci?) = det(i?) = c^, (HI) 

so c = ±1. Thus, c = 1 and R{z) = i?. □ 
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